A User-assisted Approach to Multiple Instrument Music Transcription

نویسنده

  • Holger Kirchhoff
چکیده

The task of automatic music transcription has been studied for several decades and is regarded as an enabling technology for a multitude of applications such as music retrieval and discovery, intelligent music processing and large-scale musicological analyses. It refers to the process of identifying the musical content of a performance and representing it in a symbolic format. Despite its long research history, fully automatic music transcription systems are still error prone and often fail when more complex polyphonic music is analysed. This gives rise to the question in what ways human knowledge can be incorporated in the transcription process. This thesis investigates ways to involve a human user in the transcription process. More specifically, it is investigated how user input can be employed to derive timbre models for the instruments in a music recording, which are employed to obtain instrument-specific (parts-based) transcriptions. A first investigation studies different types of user input in order to derive instrument models by means of a non-negative matrix factorisation framework. The transcription accuracy of the different models is evaluated and a method is proposed that refines the models by allowing each pitch of each instrument to be represented by multiple basis functions. A second study aims at limiting the amount of user input to make the method more applicable in practice. Different methods are considered to estimate missing non-negative basis functions when only a subset of basis functions can be extracted based on the user information. A method is proposed to track the pitches of individual instruments over time by means of a Viterbi framework in which the states at each time frame contain several candidate instrument-pitch combinations. A transition probability is employed that combines three different criteria: the frame-wise reconstruction error of each combination, a pitch continuity measure that favours similar pitches in consecutive frames, and an explicit activity model for each instrument. The method is shown to outperform other state-of-the-art multi-instrument tracking methods. Finally, the extraction of instrument models that include phase information is investigated as a step towards complex matrix decomposition. The phase relations between the partials of harmonic sounds are explored as a time-invariant property that can be employed to form complex-valued basis functions. The application of the model for a user-assisted transcription task is illustrated with a saxophone example.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Specific Music Transcription for Tutoring

An applicationspecific, musictranscription approach uses a customized human– computer interface to combine the strengths of humans and computers to enhance music transcription through instrument modeling and multimedia fusion. A utomatic music transcription (AMT) refers to the ability of computers to write note information—such as the pitch, onset time, duration, and source of each sound— after...

متن کامل

Transcribing Multi-instrument Polyphonic Music with Transformed Eigeninstrument Whole-note Templates

We present a system for the transcription of polyphonic music recordings to recover both the notes played and the instruments responsible for each note. In our framework, the spectrogram of the music is viewed as the superposition of note events, each characterized by an onset time and pitch, an instrument (described by a vector of eigeninstrument weights that combine instrument model bases to ...

متن کامل

Musical Instrument Extraction through Timbre Classification

Contemporary technological advancement of internet and online servers allows many musical pieces to be readily available to the users to enjoy. The users may listen to the music, share with friends, or create another musical piece by either remixing or sampling. One may desire to simply play the music as it is or sample just one instrument out of the music, however, this task can be challenging...

متن کامل

A supervised approach for rhythm transcription based on tree series enumeration

We present a rhythm transcription system integrated in the computer-assisted composition environment OpenMusic. Rhythm transcription consists in translating a series of dated events into traditional music notation’s pulsed and structured representation. As transcription is equivocal, our system favors interactions with the user to reach a satisfactory compromise between various criteria, in par...

متن کامل

City Research Online IMPROVING INSTRUMENT RECOGNITION IN POLYPHONIC MUSIC THROUGH SYSTEM INTEGRATION

A method is proposed for instrument recognition in polyphonic music which combines two independent detector systems. A polyphonic musical instrument recognition system using a missing feature approach and an automatic music transcription system based on shift invariant probabilistic latent component analysis that includes instrument assignment. We propose a method to integrate the two systems b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014